Tag
9 articles
Canadian AI company Cohere has open-sourced its most powerful language model to date, Command A+, under the Apache 2.0 license. This move aims to foster collaboration and accessibility in the AI community.
German AI translation company DeepL is cutting 250 jobs as it restructures to become an 'AI-native' organization, focusing on a new specialized large language model for translation and editing.
Researchers have developed Talkie-1930, a 13-billion-parameter language model trained exclusively on pre-1931 English texts, to study historical reasoning and generalization.
A 13-billion-parameter language model trained only on texts before 1931 imagines 2026 as a world of steamships and penny novels, highlighting the risks of AI systems trained on outdated data.
OpenAI releases comprehensive system card for GPT-5.5, detailing enhanced capabilities and safety measures in the advanced language model.
Zhipu AI's new GLM-5.1 model can refine its own coding strategy across hundreds of iterations, marking a major advancement in AI-driven software development.
Explore the significance of Hugging Face's TRL v1.0, a unified framework for aligning large language models through post-training techniques like SFT, Reward Modeling, DPO, and GRPO.
Learn to implement and use State Space Models with the Mamba architecture, focusing on Mamba-3's 2x smaller states and enhanced hardware efficiency.
Inception has launched Mercury 2, the first diffusion-based language reasoning model that processes entire passages in parallel, making it more than five times faster than traditional models.